A fast lattice-based approach to vocabulary independent wordspotting

نویسندگان

  • David A. James
  • Steve J. Young
چکیده

Practical applications of wordspotting, such as spoken message retrieval and browsing, require the ability to process large amounts of speech data at speeds many times faster than real-time. This paper presents a novel approach to this problem in which all of the stored audio material is prepro-cessed oo-line to generate a phoneme lattice. At search time, putative word matches are found in this lattice using symmetric dynamic programming. The paper presents the details of the algorithms used and compares performance with a number of conventional approaches using a 20 keyword vocabulary on the DARPA Resource Management Task. The results show that the proposed method is very much faster yet performs acceptably compared to conventional systems which depend on keyword-speciic training or prior knowledge of the test set vocabulary.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unrestricted Vocabulary Keyword Spotting Using LSTM-CTC

Keyword spotting (KWS) aims to detect predefined keywords in continuous speech. Recently, direct deep learning approaches have been used for KWS and achieved great success. However, these approaches mostly assume fixed keyword vocabulary and require significant retraining efforts if new keywords are to be detected. For unrestricted vocabulary, HMM based keywordfiller framework is still the main...

متن کامل

Hmm Based Fast Keyword Spotting Algorithm with No Garbage Modeliis

\ , I \ I I The problem Of discriminating keyword and non-keyword speech which is important in wordspotting applications is addressed here. We have shown that garbage models cannot reduce both rejection and false alarm rates simultaneously. Thus, the following relation becomes apparent from the above inequalities O'In I ) > " ( o ' l l P ) . P(08 I R P ) > p(oP I R 1 ) (4) TO achieve this we ha...

متن کامل

Phoneme based Spoken Document Retrieval

Since speech recognition technology has become more and more mature, retrieval of spoken documents has become a feasible task. We report about two cases, which aim at scalable and effective retrieval of broadcast recordings. The approach is based on a hybrid architecture, which combines the speed of off-line phoneme indexing and precision of wordspotting while maintaining a scalable architectur...

متن کامل

The Effect of Teaching Critical Reading Strategies on EFL Learners’ Vocabulary Retention

This study was an attempt to investigate whether teaching critical reading strategies had any significant effect on intermediate EFL learners’ vocabulary retention. To fulfill the purpose of this study, 72 male and female students within the age range of 17 to 32 years studying at Farzan and Farzanegan language schools in Tehran at intermediate level were selected from a total number of 114 par...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994